fix(libutil/tarfile): normalize legacy HTTP Content-Encoding names #14417

lovesegfault · 2025-10-29T17:32:28Z

Motivation

Nix failed to download files served with Content-Encoding: x-gzip
because libarchive doesn't recognize the legacy x-* compression
format names. Per RFC 9110 §8.4.1.3, HTTP recipients should treat
these as equivalent to their standard counterparts.

Adds normalizeCompressionMethod() to map legacy encoding names
before passing to libarchive:

x-gzip → gzip
x-compress → compress
x-bzip2 → bzip2

Context

Fixes: #14324

Add 👍 to pull requests you find important.

The Nix maintainer team uses a GitHub project board to schedule and track reviews.

Nix failed to download files served with `Content-Encoding: x-gzip` because libarchive doesn't recognize the legacy `x-*` compression format names. Per RFC 9110 §8.4.1.3, HTTP recipients should treat these as equivalent to their standard counterparts. Adds `normalizeCompressionMethod()` to map legacy encoding names before passing to libarchive: - `x-gzip` → `gzip` - `x-compress` → `compress` - `x-bzip2` → `bzip2`

xokdvium

I think that what we should do is stop using strings to represent enumeration types. We confuse compression algorithm name used by libarxhive and our non-standard Content-Enxoding headers. Those need to become clearly separated

Mic92 · 2025-10-29T18:21:20Z

I think that what we should do is stop using strings to represent enumeration types. We confuse compression algorithm name used by libarxhive and our non-standard Content-Enxoding headers. Those need to become clearly separated

you mean libarchive has enum types for compression?

xokdvium · 2025-10-29T18:25:21Z

libarchive has enum types for compression?

It doesn't unfortunately, but we really should have our own to wrap around libarchive.

Ericson2314 · 2025-10-29T21:09:23Z

I agree that making our own enum sounds like the right call.

tomberek

Can refactor into enum in another PR.

xokdvium

Note that must only affect Content-Encoding parsing. e.g. it must not be possible to specify the deprecated name as the store parameter. Since there's currently no distinction in the code it's a no-go IMO.

lovesegfault · 2025-11-10T17:40:11Z

fwiw I agree the enum approach is better here, just haven't found the time to do it

xokdvium · 2025-11-16T17:12:18Z

just haven't found the time to do it

I have some WIP commits for that. In the meantime I don't see a need to rush. This (not accepting deprecated non-standard aliases) is not a regression.

lovesegfault requested a review from edolstra as a code owner October 29, 2025 17:32

xokdvium requested changes Oct 29, 2025

View reviewed changes

tomberek approved these changes Nov 10, 2025

View reviewed changes

xokdvium requested changes Nov 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(libutil/tarfile): normalize legacy HTTP Content-Encoding names #14417

fix(libutil/tarfile): normalize legacy HTTP Content-Encoding names #14417

Uh oh!

lovesegfault commented Oct 29, 2025

Uh oh!

xokdvium left a comment

Uh oh!

Mic92 commented Oct 29, 2025

Uh oh!

xokdvium commented Oct 29, 2025

Uh oh!

Ericson2314 commented Oct 29, 2025

Uh oh!

tomberek left a comment

Uh oh!

xokdvium left a comment

Uh oh!

lovesegfault commented Nov 10, 2025

Uh oh!

xokdvium commented Nov 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

fix(libutil/tarfile): normalize legacy HTTP Content-Encoding names #14417

Are you sure you want to change the base?

fix(libutil/tarfile): normalize legacy HTTP Content-Encoding names #14417

Uh oh!

Conversation

lovesegfault commented Oct 29, 2025

Motivation

Context

Uh oh!

xokdvium left a comment

Choose a reason for hiding this comment

Uh oh!

Mic92 commented Oct 29, 2025

Uh oh!

xokdvium commented Oct 29, 2025

Uh oh!

Ericson2314 commented Oct 29, 2025

Uh oh!

tomberek left a comment

Choose a reason for hiding this comment

Uh oh!

xokdvium left a comment

Choose a reason for hiding this comment

Uh oh!

lovesegfault commented Nov 10, 2025

Uh oh!

xokdvium commented Nov 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants